Matrix correlations for high-dimensional data: the modified RV-coefficient
نویسندگان
چکیده
MOTIVATION Modern functional genomics generates high-dimensional datasets. It is often convenient to have a single simple number characterizing the relationship between pairs of such high-dimensional datasets in a comprehensive way. Matrix correlations are such numbers and are appealing since they can be interpreted in the same way as Pearson's correlations familiar to biologists. The high-dimensionality of functional genomics data is, however, problematic for existing matrix correlations. The motivation of this article is 2-fold: (i) we introduce the idea of matrix correlations to the bioinformatics community and (ii) we give an improvement of the most promising matrix correlation coefficient (the RV-coefficient) circumventing the problems of high-dimensional data. RESULTS The modified RV-coefficient can be used in high-dimensional data analysis studies as an easy measure of common information of two datasets. This is shown by theoretical arguments, simulations and applications to two real-life examples from functional genomics, i.e. a transcriptomics and metabolomics example. AVAILABILITY The Matlab m-files of the methods presented can be downloaded from http://www.bdagroup.nl.
منابع مشابه
Evaluating modularity in morphometric data: challenges with the RV coefficient and a new test measure
1. Modularity describes the case where patterns of trait covariation are unevenly dispersed across traits. Specifically, trait correlations are high and concentrated within subsets of variables (modules), but the correlations between traits across modules are relatively weaker. For morphometric data sets, hypotheses of modularity are commonly evaluated using theRV coefficient, an association st...
متن کاملPool boiling heat transfer coefficient of pure liquids using dimensional analysis
The pool boiling heat transfer coefficient of pure liquids were experimentally measured on a horizontal bar heater at atmospheric pressure. These measurements were conducted for more than three hundred data in thermal currents up to 350 kW.m-2. Original correlations and the unique effect of these correlations on experimental data were discussed briefly. According to the analysis, a new empirica...
متن کاملCalculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms
The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...
متن کاملAn Improved Correlation for Second Virial Coefficients of Pure Fluids
In the present work, a modified correlation is presented for the second virial coefficients of both polar and nonpolar fluids based on the corresponding states principle. The second virial coefficients of gaseous polar and non-polar compounds were calculated and compared with experimental data and with other correlations. Comparisons with the existing correlations show that the present work is ...
متن کاملPool Boiling Heat Transfer in Water/Amines Solutions
In this investigation, nucleate boiling heat transfer coefficients were experimentally measured during pool boiling of mixtures, consisting of water/monoethanolamine and water/diethanolamine on a horizontal heating rod, under atmospheric pressure. The experiment was carried out below 205 kW.m-2 heat flux, over a wide range of concentrations. These experiments include, measurement of pool boilin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 25 3 شماره
صفحات -
تاریخ انتشار 2009